AITopics

Country: North America > United States (0.28)

Industry:

Information Technology > Services (0.66)
Energy > Oil & Gas > Upstream (0.65)

Technology:

Information Technology > Cloud Computing (0.84)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.47)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)

Neural Information Processing SystemsSep-26-2025, 18:32:25 GMT

Data center cooling using model-predictive control

Nevena Lazic, Craig Boutilier, Tyler Lu, Eehern Wong, Binz Roy, MK Ryu, Greg Imwalle

Neural Information Processing Systems http://nips.cc/

controller, machine learning, reinforcement learning, (17 more...)

Country: North America > United States (0.28)

Industry:

Information Technology > Services (0.86)
Energy > Oil & Gas > Upstream (0.65)

Technology:

Information Technology > Cloud Computing (1.00)
Information Technology > Information Management (0.72)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.47)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)

Neural Information Processing SystemsOct-7-2024, 04:01:05 GMT

Reviews: Data center cooling using model-predictive control

This paper addresses the problem of temperature and airflow regulation for a large-scale data center and considers how a data-driven, model-based approach using Reinforcement Learning (RL) might improve operational efficiency relative to the existing approach of hand-crafted PID controllers. Existing controllers in large-scale data centers tend to be simple, conservative and hand-tuned to physical equipment layouts and configurations. Safety constraints and a low tolerance for performance degradation and equipment damage impose additional constraints. The authors use model-predictive control (MPC) to learn a linear model of the data center dynamics (a LQ controller) using safe, random exploration, starting with little or no prior knowledge. They then determine the control actions at each time step by optimizing the cost of the model-predicted trajectories, ensuring to re-optimize at each time step.

artificial intelligence, machine learning, model-predictive control, (8 more...)

Industry:

Information Technology > Services (1.00)
Energy > Oil & Gas > Upstream (0.62)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Cloud Computing (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.58)
Information Technology > Artificial Intelligence > Representation & Reasoning > Model-Based Reasoning (0.38)

Baumeister, Fabian, Mack, Lukas, Stueckler, Joerg

Incremental Few-Shot Adaptation for Non-Prehensile Object Manipulation using Parallelizable Physics Simulators

arXiv.org Artificial IntelligenceSep-20-2024

Few-shot adaptation is an important capability for intelligent robots that perform tasks in open-world settings such as everyday environments or flexible production. In this paper, we propose a novel approach for non-prehensile manipulation which iteratively adapts a physics-based dynamics model for model-predictive control. We adapt the parameters of the model incrementally with a few examples of robot-object interactions. This is achieved by sampling-based optimization of the parameters using a parallelizable rigid-body physics simulation as dynamic world model. In turn, the optimized dynamics model can be used for model-predictive control using efficient sampling-based optimization. We evaluate our few-shot adaptation approach in several object pushing experiments in simulation and with a real robot.

artificial intelligence, simulation, trajectory, (18 more...)

2409.13228

Country: Europe > Germany (0.14)

Genre: Research Report > Promising Solution (0.34)

Industry: Energy > Oil & Gas > Upstream (0.56)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)

Zhang, Yuan, Yang, Shaohui, Ohtsuka, Toshiyuki, Jones, Colin, Boedecker, Joschka

Latent Linear Quadratic Regulator for Robotic Control Tasks

arXiv.org Artificial IntelligenceJul-15-2024

Model predictive control (MPC) has played a more crucial role in various robotic control tasks, but its high computational requirements are concerning, especially for nonlinear dynamical models. This paper presents a $\textbf{la}$tent $\textbf{l}$inear $\textbf{q}$uadratic $\textbf{r}$egulator (LaLQR) that maps the state space into a latent space, on which the dynamical model is linear and the cost function is quadratic, allowing the efficient application of LQR. We jointly learn this alternative system by imitating the original MPC. Experiments show LaLQR's superior efficiency and generalization compared to other baselines.

artificial intelligence, ieee rsj international conference, latent linear quadratic regulator, (9 more...)

2407.11107

Country:

Europe > Germany (0.15)
Asia > Japan (0.15)

Genre: Research Report (0.40)

Industry: Energy > Oil & Gas (0.83)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Bishop, Arun L., Zhang, John Z., Gurumurthy, Swaminathan, Tracy, Kevin, Manchester, Zachary

ReLU-QP: A GPU-Accelerated Quadratic Programming Solver for Model-Predictive Control

arXiv.org Artificial IntelligenceNov-29-2023

We present ReLU-QP, a GPU-accelerated solver for quadratic programs (QPs) that is capable of solving high-dimensional control problems at real-time rates. ReLU-QP is derived by exactly reformulating the Alternating Direction Method of Multipliers (ADMM) algorithm for solving QPs as a deep, weight-tied neural network with rectified linear unit (ReLU) activations. This reformulation enables the deployment of ReLU-QP on GPUs using standard machine-learning toolboxes. We evaluate the performance of ReLU-QP across three model-predictive control (MPC) benchmarks: stabilizing random linear dynamical systems with control limits, balancing an Atlas humanoid robot on a single foot, and tracking whole-body reference trajectories on a quadruped equipped with a six-degree-of-freedom arm. These benchmarks indicate that ReLU-QP is competitive with state-of-the-art CPU-based solvers for small-to-medium-scale problems and offers order-of-magnitude speed improvements for larger-scale problems.

artificial intelligence, machine learning, relu-qp, (19 more...)

2311.18056

Country: North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)

Genre: Research Report (0.50)

Industry: Energy > Oil & Gas > Upstream (0.62)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.66)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.49)

Alavilli, Anoushka, Nguyen, Khai, Schoedel, Sam, Plancher, Brian, Manchester, Zachary

TinyMPC: Model-Predictive Control on Resource-Constrained Microcontrollers

arXiv.org Artificial IntelligenceOct-25-2023

Model-predictive control (MPC) is a powerful tool for controlling highly dynamic robotic systems subject to complex constraints. However, MPC is computationally demanding, and is often impractical to implement on small, resource-constrained robotic platforms. We present TinyMPC, a high-speed MPC solver with a low memory footprint targeting the microcontrollers common on small robots. Our approach is based on the alternating direction method of multipliers (ADMM) and leverages the structure of the MPC problem for efficiency. We demonstrate TinyMPC both by benchmarking against the state-of-the-art solver OSQP, achieving nearly an order of magnitude speed increase, as well as through hardware experiments on a 27 g quadrotor, demonstrating high-speed trajectory tracking and dynamic obstacle avoidance.

artificial intelligence, model-predictive control, resource-constrained microcontroller, (1 more...)

2310.16985

Genre: Research Report (0.40)

Industry: Energy > Oil & Gas > Upstream (0.60)

Technology: Information Technology > Artificial Intelligence > Robots (0.73)

Ceusters, Glenn, Rodríguez, Román Cantú, García, Alberte Bouso, Franke, Rüdiger, Deconinck, Geert, Helsen, Lieve, Nowé, Ann, Messagie, Maarten, Camargo, Luis Ramirez

Model-predictive control and reinforcement learning in multi-energy system case studies

arXiv.org Artificial IntelligenceApr-20-2021

Model-predictive-control (MPC) offers an optimal control technique to establish and ensure that the total operation cost of multi-energy systems remains at a minimum while fulfilling all system constraints. However, this method presumes an adequate model of the underlying system dynamics, which is prone to modelling errors and is not necessarily adaptive. This has an associated initial and ongoing project-specific engineering cost. In this paper, we present an on- and off-policy multi-objective reinforcement learning (RL) approach, that does not assume a model a priori, benchmarking this against a linear MPC (LMPC - to reflect current practice, though non-linear MPC performs better) - both derived from the general optimal control problem, highlighting their differences and similarities. In a simple multi-energy system (MES) configuration case study, we show that a twin delayed deep deterministic policy gradient (TD3) RL agent offers potential to match and outperform the perfect foresight LMPC benchmark (101.5%). This while the realistic LMPC, i.e. imperfect predictions, only achieves 98%. While in a more complex MES system configuration, the RL agent's performance is generally lower (94.6%), yet still better than the realistic LMPC (88.9%). In both case studies, the RL agents outperformed the realistic LMPC after a training period of 2 years using quarterly interactions with the environment. We conclude that reinforcement learning is a viable optimal control technique for multi-energy systems given adequate constraint handling and pre-training, to avoid unsafe interactions and long training periods, as is proposed in fundamental future work.

renewable energy, survey article, upstream oil & gas, (18 more...)

2104.09785

Country:

Europe > Belgium > Flanders (0.14)
Europe > Netherlands (0.14)
North America > United States > Texas (0.14)
(3 more...)

Genre: Research Report (1.00)

Industry:

Energy > Oil & Gas > Upstream (0.85)
Energy > Oil & Gas > Downstream (0.76)
Energy > Renewable > Geothermal (0.68)
Energy > Renewable > Solar (0.67)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Bharadhwaj, Homanga, Xie, Kevin, Shkurti, Florian

Model-Predictive Control via Cross-Entropy and Gradient-Based Optimization

arXiv.org Machine LearningApr-18-2020

Recent works in high-dimensional model-predictive control and model-based reinforcement learning with learned dynamics and reward models have resorted to population-based optimization methods, such as the Cross-Entropy Method (CEM), for planning a sequence of actions. To decide on an action to take, CEM conducts a search for the action sequence with the highest return according to the dynamics model and reward. Action sequences are typically randomly sampled from an unconditional Gaussian distribution and evaluated on the environment. This distribution is iteratively updated towards action sequences with higher returns. However, this planning method can be very inefficient, especially for high-dimensional action spaces. An alternative line of approaches optimize action sequences directly via gradient descent, but are prone to local optima. We propose a method to solve this planning problem by interleaving CEM and gradient descent steps in optimizing the action sequence. Our experiments show faster convergence of the proposed hybrid approach, even for high-dimensional action spaces, avoidance of local minima, and better or equal performance to CEM. Code accompanying the paper is available here 1 .

action sequence, optimization problem, upstream oil & gas, (14 more...)

arXiv.org Machine Learning

2004.08763

Country: North America > Canada > Ontario > Toronto (0.29)

Genre:

Workflow (1.00)
Research Report (1.00)

Industry: Energy > Oil & Gas > Upstream (0.63)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.73)

Neural Information Processing SystemsFeb-14-2020, 13:28:38 GMT

Data center cooling using model-predictive control

Lazic, Nevena, Boutilier, Craig, Lu, Tyler, Wong, Eehern, Roy, Binz, Ryu, MK, Imwalle, Greg

Despite impressive recent advances in reinforcement learning (RL), its deployment in real-world physical systems is often complicated by unexpected events, limited data, and the potential for expensive failures. In this paper, we describe an application of RL "in the wild" to the task of regulating temperatures and airflow inside a large-scale data center (DC). Adopting a data-driven, model-based approach, we demonstrate that an RL agent with little prior knowledge is able to effectively and safely regulate conditions on a server floor after just a few hours of exploration, while improving operational efficiency relative to existing PID controllers. Papers published at the Neural Information Processing Systems Conference.

artificial intelligence, data center, upstream oil & gas, (3 more...)

Industry:

Information Technology > Services (0.68)
Energy > Oil & Gas > Upstream (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.70)